Computational auditory scene analysis: A representational approach
نویسندگان
چکیده
منابع مشابه
Auditory Scene Analysis: Computational Models
Listeners have to make sense of a complex acoustic world containing overlapping sound sources that must be organized into individual auditory objects. Computational auditory scene analysis concerns the use of algorithms inspired by human sound perception whose aim is to extract properties of constituent sound sources in a complexmixture. Starting with representations based on models of how soun...
متن کاملAuditory Scene Analysis: Computational Models
Human listeners have a remarkable ability to separate a complex mixture of sounds into discrete sources. The processes underlying this ability have been termed ‘auditory scene analysis’ (Bregman 1990; this volume). Recently, an interdisciplinary field known as ‘computational auditory scene analysis’ (CASA) has emerged which aims to develop computer systems that mimic this aspect of hearing (Ros...
متن کاملA computational model of auditory scene analysis
Various grouping attributes have been translated into successful signal processing techniques that may be used in source separation, e.g., to separate speech from background. However, separation is not enough to know: What is the source of the sound? A next step beyond primitive ASA is schema-based ASA, to give meaning to the source, i.e. to map bottom-up audio features to the meaningful conten...
متن کاملPrediction-driven computational auditory scene analysis
The sound of a busy environment, such as a city street, gives rise to a perception of numerous distinct events in a human listener – the ‘auditory scene analysis’ of the acoustic information. Recent advances in the understanding of this process from experimental psychoacoustics have led to several efforts to build a computer model capable of the same function. This work is known as ‘computation...
متن کاملA computational auditory scene analysis system for robust speech recognition
We present a computational auditory scene analysis system for separating and recognizing target speech in the presence of competing speech or noise. We estimate, in two stages, the ideal binary time-frequency (T-F) mask which retains the mixture in a local TF unit if and only if the target is stronger than the interference within the unit. In the first stage, we use harmonicity to segregate the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Journal of the Acoustical Society of America
سال: 1993
ISSN: 0001-4966
DOI: 10.1121/1.407441